Breast Cancer Prognostics Using Multi-Omics Data

نویسندگان

  • Sisi Ma
  • Jiwen Ren
  • David Fenyö
چکیده

Breast cancer affects one in eight women in America and is a leading cause of death from cancer worldwide. In the current study, four types of Omics data including copy number variation, gene expression, proteome and phosphoproteome were collected from seventy-seven breast cancer patients. Individual types of Omics data were used to separately construct predictive models to predict ten-year survival, an important clinical hallmark. The predictive models constructed with proteome data achieved decent predictivity (mean AUC = 0.725) and outperforms the models constructed with other types of Omics data. This indicates that high quality, large scale protein data is more effective for survival prediction compared to other types of omics data. Further, we experimented with ten different data fusion techniques (generic and Multi-kernel learning based) to test whether combining multi-Omics data can result in improved predictive performance. None of the data fusion techniques tested in the current study outperforms the predictive models built with the proteome data.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Pathway-level Integration of Proteogenomic Data in Breast Cancer Using Independent Component Analysis

Recent advances in the multi-omics characterization necessitate pathway-level abstraction and knowledge integration across different data types. In this study, we apply independent component analysis (ICA) to human breast cancer proteogenomics data to retrieve mechanistic information. We show that as an unsupervised feature extraction method, ICA was able to construct signatures with known biol...

متن کامل

LinkedOmics: analyzing multi-omics data within and across 32 cancer types

The LinkedOmics database contains multi-omics data and clinical data for 32 cancer types and a total of 11 158 patients from The Cancer Genome Atlas (TCGA) project. It is also the first multi-omics database that integrates mass spectrometry (MS)-based global proteomics data generated by the Clinical Proteomic Tumor Analysis Consortium (CPTAC) on selected TCGA tumor samples. In total, LinkedOmic...

متن کامل

Omics Pipe: a community-based framework for reproducible multi-omics data analysis

MOTIVATION Omics Pipe (http://sulab.scripps.edu/omicspipe) is a computational framework that automates multi-omics data analysis pipelines on high performance compute clusters and in the cloud. It supports best practice published pipelines for RNA-seq, miRNA-seq, Exome-seq, Whole-Genome sequencing, ChIP-seq analyses and automatic processing of data from The Cancer Genome Atlas (TCGA). Omics Pip...

متن کامل

Integrating Multi-Omics for Uncovering the Architecture of Cross-Talking Pathways in Breast Cancer

Cross-talk among abnormal pathways widely occurs in human cancer and generally leads to insensitivity to cancer treatment. Moreover, alterations in the abnormal pathways are not limited to single molecular level. Therefore, we proposed a strategy that integrates a large number of biological sources at multiple levels for systematic identification of cross-talk among risk pathways in cancer by r...

متن کامل

Inflammatory Breast Neoplasms: a Systematic Review

Overview: Inflammatory Breast Cancer (IBC) is a rare and very aggressive type of cancer that tends to develop at a younger age, compared with other subtypes of breast cancer. Because a distinct lump may not be noticeable, correct diagnosis takes longer and, therefore, successful treatment may hinder a patient’s prognostics. This study aims to conduct a systematic review of research articles on ...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره 2016  شماره 

صفحات  -

تاریخ انتشار 2016